전역적인 배리어 동기화를 수행하지 않는 효율적인 BSP 연산 기업

김진수; 하순희; 전주식; Jinsoo Kim; Soonhoi Ha; Chu Shik Jhon

연구문헌

국내 논문지

홈 > 연구문헌 > 국내 논문지 > 한국정보과학회 논문지 > 정보과학회 논문지 A : 시스템 및 이론

정보과학회 논문지 A : 시스템 및 이론

Current Result Document : 3 / 6 이전건 다음건

한글제목(Korean Title)	전역적인 배리어 동기화를 수행하지 않는 효율적인 BSP 연산 기업
영문제목(English Title)	An Efficient BSP Computation without Global Barrier Synchronization
저자(Author)	김진수 하순희 전주식 Jinsoo Kim Soonhoi Ha Chu Shik Jhon
원문수록처(Citation)	VOL 25 NO. 07 PP. 0655 ~ 0667 (1998. 07)
한글내용 (Korean Abstract)	BSP (Bulk Synchronous Parallel) 연산 모델은 다양한 병렬처리 구조상에서 효율적이고 이식성 높은 병렬 프로그램을 개발하는데 매우 유용하다. 그러나 BSP 모델에서 사용되는 배리어 동기화는 메시지 패싱 구조의 경우 상대적으로 큰 오버헤드를 수반한다. 이에 따라 본 논문에서는 메시지 패싱 구조상에서 BSP 연산 모델의 효율적인 구현을 위해 BSP 모델의 배리어 동기화 조건을 완화시킨 연성 배리어 동기화 기법을 제안한다. 제안하는 기법은 기존의 방법과는 달리 전역적인 배리어 동기화를 수행하지 않고, 각각의 프로세서들이 외부 데이타를 참조할 경우에만 송신 프로세서와 동기화가 행해지도록 한다. 따라서 각 프로세서들은 자신의 상대적인 수행 속도와 동기화 요건에 따라 서로 다른 수퍼스텝을 수행하게 되며, 연성 배리어 동기화 기법이 이들간 데이타 접근의 일관성을 보장한다. IBM SP2 상에서의 실험 결과 32개의 프로세서들 사용한 경우 기존의 구현 방법에 비해 FT에서는 45.2%에서 61.5%, LU에서는 28.6%에 49.0%의 도익화 시간이 감소됨을 확인하였다.
영문내용 (English Abstract)	The Bulk Synchronous Parallel (BSP) model of computation can be used to develop efficient and portable programs for a range of machines and applications. However, the cost of the barrier synchronization used in the BSP model is relatively expensive for message-passing architectures. In this paper, we relax the barrier synchronization constraint in the BSP model for the efficient implementation on message-passing architectures. In our relaxed barrier synchronization, the synchronization occurs at the time of accessing non-local data only between the producer and the consumer processors, eliminating the exchange of global information. Because processors are not globally synchronized, each processor may execute different superstep according to its relative speed and the synchronization requirements. From the experimental evaluations on IBS SP2, we have observed that the relaxed barrier synchronization reduces the average synchronization time by 45.2% to 61.5% in FT, and 28.6% to 49.0% in LU with 32 processors.
키워드(Keyword)
파일첨부	PDF 다운로드